Protein-mediated Loops and Phase Transition in Nonthermal Denaturation of DNA 
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We use a statistical mechanical model to study nonthermal denaturation of DNA in the presence 
of protein-mediated loops. We find that looping proteins which randomly link DNA bases located at 
a distance along the chain could cause a first-order phase transition. We estimate the denaturation 
transition time near the phase transition, which can be compared with experimental data. The model 
describes the formation of multiple loops via dynamical (fluctuational) linking between looping 
proteins, that is essential in many cellular biological processes. 
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Denaturation of DNA is a fundamental biological pro- 
cess before the transcription stage Thermal denatu- 
ration of DNA [2| has been modelled in many ways, in- 
cluding the ladder Q, Poland-Scheraga [H and Peyrard- 
Bishop [| models. The process still attracts attention of 
theoreticians in an attempt to describe it most efficiently 
@. Besides the melting, DNA also denatures under the 
influence of other factors such as pH value, salt concentra- 
tion, other chemical factors, and mechanical forces. One 
example of the latter is the DNA denaturation induced by 
an externally applied torque. The experiments with sin- 
le DNA molecules under torsional stress were reported in 
H that shed more light on the mechanical properties 
of DNA molecules in connection with their functioning in 
living cells. In vivo the torque is exerted by the RNA poly- 
merase that causes transcription-generated torsional stress 
@ (see also [T3| where a direct observation of DNA rotation 
during transcription by Escherichia coli RNA polymerase 
was reported) . A theoretical study of torque- induced DNA 
denaturation was presented by Cocco and Monasson 
and a thorough investigation of the effect of mechanical 
forces and torques on DNA and its denaturation was done 
by Marko [TJ]. 

Here we are interested in nonthermal denaturation of 
DNA that precedes the transcription process. Transcrip- 
tion regulation typically involves the binding of proteins 
over long distances on multiple DNA sites which are then 
brought close to each other to form DNA loops [l3(. The 
DNA loops can be formed by protein complexes, e.g., by 
the regulators of bacterial operons, such as ara, gal, and 
lac, and human proteins involved in cancer, such as retinoic 
X receptor. The presence of protein-mediated loops is 
also important for many other cellular processes, including 
DNA replication, recombination, and nucleosome position- 
ing as was extensively discussed in 

Recently Vilar and Saiz [l5| studied multiprotein DNA 
looping. They developed a model of formation of a sin- 
gle loop via connection of an arbitrary large number of 
proteins. Their model describes a switchlike transition be- 
tween looped and unlooped phases, and has been extended 
to account for multiple loops [14| . Dynamic protein- 
mediated loops within the framework of molecular sys- 
tems biology were considered in [r| for the cases of the 
lac operon and phage A induction switches. Here we con- 
sider a different model to describe the denaturation of 
DNA, which has loops formed by proteins that link bases 



randomly located along the molecular chain. Thus our 
model accounts for formation of multiple loops that is es- 
sential in cellular biological processes like pre-mRNA splic- 
ing [l[ . Yet another important feature of our model is that 
it presents a dynamic rather than static picture of forma- 
tion of loops as the protein-mediated links between the 
base pairs fluctuate, i.e. the proteins couple and decouple 
in the course of time. This demonstrates a connection be- 
tween formation of the structure of protein- mediated loops 
for the particular DNA-protein node-link interaction net- 
work and co-evolutionary complex networks flU fl?| . We 
show below that looping proteins can make the nonthermal 
denaturation process to be a first-order phase transition, 
ft is due to the effective long-range interactions by the me- 
diating proteins. We are primarily interested in the phase 
transition, in the metastability phenomenon that we have 
found and in the kinetics of the denaturation. We then 
calculate the transition time from the double-helix state 
to the coil state, which can be compared with experimen- 
tal data. 

The Model. Lattice models proved to be useful in stud- 
ies of the phenomenology of DNA denaturation [l8[ . Here 
we consider a simple statistical mechanical model defined 
on an one-dimensional lattice with each site corresponding 
to a rung of the ladder [3s] . A spin variable is associated 
with each site i where <7,- = — 1 when the corresponding 
ii-bond is intact and oi = +1 when it is broken. We as- 
sume an arbitrary folding of the DNA molecule so that any 
two base pairs may get connected via the looping proteins. 
The proposed model has the following Hamiltonian 
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where g > is the coupling parameter of nearest-neighbor 
interactions; Jij are the link variables, taking values and 
1 when the i and j nodes are uncoupled or coupled by 
the proteins, correspondingly; the absence or presence of 
proteins at site i is defined by the variable pi that takes 
values or 1, respectively; 7 > is the energy of interac- 
tion between the base pair sites coupled via an appeared 
link caused by the on-site proteins; h is the binding energy 



between base pairs that includes the energy of the hydro- 
gen bonds; e is the energy of binding of a protein at the 
site i; a is the energy of formation of a link connecting i 
and j sites; N is the number of base pairs. 

The first term assures that broken pairs tend to break 
pairs next to them and in the same way it makes to pair up 
bases next to paired ones. The second term describes cre- 
ation of links between proteins bound to bases at random 
sites of the molecule. These links form protein-mediated 
loops. The links actually fluctuate as the proteins at differ- 
ent sites may couple and decouple in the course of time. In 
general, 7 may depend on the length of the loop. However, 
such a dependence is a higher order effect and we do not 
consider that. In the third term the energy h depends on 
the external parameters that are determined by environ- 
mental conditions such as temperature T, pH value, salt 
concentration, and other chemical as well as mechanical 
factors. Change in h may cause openings and closings of 
base pairs. As an example we will consider its dependence 
on an externally applied torque. The energy h is a sum of 
contributions from the base pairing energy ho < and from 
the torsional energy h Tl associated with a change in the lo- 
cal twist, that is h = ho + h T , where h T = (l/2)C(Ao;) 2 , 
with C being the twisting elastic constant (torsional stiff- 
ness) and Alo = lu — luq being the deviation of the spatial 
angular frequency to (change of the rung angle around the 
axis per unit length along the chain) from its unstressed 
value uio [III ■ The torsional energy can also be represented 
via the torque r as h T = Tipo, where ipo = 27r/10.5 = 0.6 
radians per base pair (double helix contains about 10.5 
base pairs per helical turn). The fourth term is the energy 
of binding of proteins. The last (fifth) term is the energy 
of formation of a link between base pairs mediated by the 
looping proteins. We will use another parameter c defined 
via c/(N - c) = e~ Q/3 , where (3 is the inverse temperature. 
The ratio can be roughly treated as the probability of a link 
formation (see (l6j for more rigorous formulations and de- 
tails of a related model that describes a network of fluctu- 
ating links). We will assume sparse connectivity c/N <C 1 
with the number of looping proteins much less than the 
number of bases. The Hamiltonian may also include long- 
range direct H-bond interactions between open base pairs 
via a term proportional to Ylfj A%j (1 + <7j)(l + <jj) with an 
interaction matrix j4y . However we neglect these interac- 
tions assuming that their contribution is smaller compared 
to the interactions mediated by proteins [20( | . 

Effective Hamiltonian and Free Energy. The small 
number of proteins compared to the number of base pairs 
allows us to reduce dJ) to an effective mean-field type 
Hamiltonian. For that purpose we eliminate consequently 
Jij and Pi variables while calculating the partition function 
Z = Trj t p i(T e~P H , where the trace means summing up over 
the corresponding variables. Taking the trace over J^'s 



[16j | we arrive at the partition function Z 
with the following effective Hamiltonian 
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FIG. 1: Double helix fraction 6 = (1 — /Lt)/2 vs the base pair 
binding energy h for the parameters g = 8.5 kcal/mol, 7 = 0.02 
kcal/mol, c = 10, e = 7.2 kcal/mol and ksT = 0.6 kcal/mol. 
The first-order denaturation phase transition occurs at the crit- 
ical value h c = 0. The critical torque is r c = 1.6fcgT for AT-rich 
and t c = IksT for GC-rich chains. The double helix (coil) be- 
comes metastable for h > (h < 0) as indicated by dashes. 



Here 7' = (c/N) sinh/37 and A = (c/i\T) (cosh 07 - 1). The 
Hamiltonian ^ describes a system consisting of two in- 
teracting subsystems, DNA and proteins. Different time 
scales and different temperatures for two subsystems may 
lead to novel phenomena [22| . However, here we assume 
that DNA and proteins are in contact with the same heat 
bath at temperature T. 

For the case of strong binding energies e 3> A, 7', we can 
make a mean- field approximation and replace p^s by their 
mean values (p) — e^ £ /(l -I- e^ £ ), the proposed model is 
then reduced to the following effective Hamiltonian 
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where 7" = -| sinh/37 • ^ jj-gy; ) represents the effective 
coupling between base pairs mediated by proteins. Notice 
that we have neglected the effect caused by the presence 
of the persistence length Iq that would require to take into 
account only the terms for which one has \i — j\ > lo as the 
correction would be of order O(jfr) and would go to zero in 
the thermodynamic limit. The coupling in ((3|) is similar to 
that of a synchronization model with small world coupling 

To calculate the partition function Z cx Tr a e ^ H "ff for 
the Hamiltonian ([3]), we use the relationship X}j<j <J i (T j = 
hN, the Hubbard-Stratonovich transformation 

and the ex- 
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pression for the partition function of the one-dimensional 
(ID) Ising model [3[. Then the partition function takes the 
form Z oc d/ie~ l3N ^ ,1 ' > with the effective free energy 
f(/i) given by 
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FIG. 2: Free energy vs the order parameter /j, at the base pair 
binding energy value h — — 0.05kcal/mol for the parameters 
g = 8.5 kcal/mol, 7 = 0.02 kcal/mol, c = 10, e = 7.2 kcal/mol 
and fesT = 0.6 kcal/mol. Double helix is stable and coil is 
metastable for h < 0. 

+ sinh 2 f3(h + bfi) +e" 4 ^]. (4) 

Here 6 = c-sinh/37- f j^^; J and /i is the order parameter 

for the denaturation process. For the double helix state 
with all base pairs bound, ji = — 1; for the completely 
denaturated state, \i = 1. The values of /i, which determine 
the state of the molecule, are obtained via f'(fj) = that 
leads to the equation 

sinh/3(ft- + bu) 

M= y (5) 

ysinh 2 + tyi) + 

The model is an effective Ising model with ID nearest- 
neighbor and global (all-to-all) interactions. It can be 
shown that the model goes through a phase transition pro- 
vided j3be 2 ^ 9 > 1. That gives the necessary condition for 
the model parameters, e.g., the temperature. The suffi- 
cient condition for the phase transition would be the sign 
change of h. Thus h c = or r c = Zio/Vo defines the critical 
point for the first-order phase transition if necessary condi- 
tion f3be 2 " 9 > 1 is satisfied for the given parameters. The 
critical torque r c ranges from 1.6fcsT for weakly bound 
(AT-rich) sequences to 7fceT for the most strongly bound 
(GC-rich) sequences [zj j. 

In order to quantify the degree of denaturation we in- 
troduce the parameter 9 = (1 — n)/2 that is the fraction of 
bound base pairs. The parameter takes the value 9 = 1 for 
the double helix state and the value 9 — for the denat- 
urated coiled state. The dependence of the double helix 
fraction 9 on h is presented in Fig[TJ There is a metasta- 
bility in a range of the controlling external parameter h. 
This effect is illustrated in FigJ5]where the free energy with 
two minima is presented. Notice that there is no phase 
transition if the proteins do not interact (7 = 0) and thus 
the protein- mediated loops are absent. These are the loop- 
ing proteins which provide with the long-range interactions 
that make it possible to obtain a phase transition for the 
effectively ID lattice model. 



Transition time. The kinetics of the denaturation 
transition can be treated via the Langevin equation /i = 
— T 9 ^^ +£(t), where T defines the inverse relaxation time, 
is the while noise satisfying the relation (£(t)£(t')) = 
D6(t—t') with the diffusion coefficient D determined by the 
fluctuation-dissipation relation D = 2TksT. The corre- 
sponding Fokker-Planck equation (FPE) for the probabil- 
ity distribution function P(/j) of the order parameter /1 is 
P = £-A(v)P+±D£?P, where A(fi) = -1^. Making 

the transformations P — > Pe~&~ , D — ► DT and t — > t/T, 
we can rewrite the FPE as — P = HP with the Hamilto- 
man H = + + ±§ where $( M ) = -f'(fx). 

One can exactly solve the FPE to obtain P t (fi,/j,o) = 
\M^)\ 2 + e^^^nWnW, where ip (p) « 

e -/(M)/ 2fe s T an( j ^ i s the initial value. The decay rates 
A„ and the eigenfunctions ipn(fJ<) can be, in principle, de- 
rived exactly [251 ]. However, we are not considering here 
the dynamics of the probability distribution function. Our 
goal is to analyze the dependence of the transition rate 
on the model parameters, such as temperature T. There- 
fore we are only interested in the first eigenvalue given by 

Al ^ f y/f"(»rni n )\f"(jl m ax)\-e-iM> 1 ™* )~f{^ m )\ which 

governs the dynamics for long times. 

Let us consider the transition from the left minimum of 
the free energy in Fig. 2, corresponding to the native dou- 
ble helix state of DNA, to the right minimum representing 
denaturated state at the critical value h c . At this value, 
that corresponds to the first-order phase transition point, 
the free energy is a symmetric curve with two equal min- 
ima and the maximum located at /1 = 0. The transition 
time (the inverse transition frequency) is twice the 
time needed to achieve the top of barrier fx max from the 
minimum /i TO , n which is obtained from Ai 

n -t^^_e (6) 

max I 

This is a standard expression for the Kramers problem 
[26| . However there is a qualitative difference since the 
potential /(//) itself depends on temperature. The behav- 
ior of the denaturation transition time versus temperature 
drastically differs from the conventional Arrhenius case. 
Although the transition time first decreases at very low 
temperatures (frozen DNA) it begins to increase at high 
enough (physiological) temperatures. The reason is that 
the second derivative present in the denominator of Eq. ([6]) 
at the point Umax = diverges since /"(0) = 6(1 — (3be 2 ^ 9 ) 
and (3be 2l3g — > 1 (/"(0) — ► 0) at the critical temperature 
defined by the above mentioned necessary condition of de- 
naturation. For the set of parameters given in Figs. 1 
and 2, the transition time is = 2.35 • 10 _5 r^ 1 . The 
kinetics of pH-driven denaturation of DNA was studied 
experimentally in [271 ]. where the transition time for sin- 
gle molecule denaturation was estimated to be of order of 
1 -i- 10 seconds. Taken these values we come up with the 
inverse relaxation time V to be of order of 10~ 6 Hz. How- 
ever we believe that modern measurements in experiments 
with single molecules are needed to find precise values of 
the quantities. 

Discussion. In summary, we have introduced and stud- 
ied a model of nonthermal denaturation of DNA that can 



be induced by chemical factors, such as pH value or salt 
concentration, or by externally applied mechanical forces 
and torques (as an example we considered the case of 
torque-induced denaturation) in the presence of protein- 
mcdiatcd loops. The model accounts for proteins that bind 
to the DNA molecule. The bound proteins are then al- 
lowed to interact in a random way with each other thus 
creating the loops. We have found a first-order denatura- 
tion phase transition that is caused by the looping proteins, 
the proteins that connect base pairs that are at a distance 
along the chain. The model possesses a metastability re- 
gion provided that the necessary and sufficient conditions 
are satisfied. The kinetics of the denaturation phase tran- 
sition was described by a stochastic dynamics for the order 
parameter that is, in principle, exactly solvable. However 
we have been mainly interested in obtaining the transition 
rate in the vicinity of the first-order phase transition, ft 
has the standard form by Kramers with the associated po- 
tential being temperature-dependent. This leads to devia- 
tion from the Arrhenius law at physiological temperatures. 
In particular, the transition time becomes extremely large 
when the temperature approaches its critical value that 
is defined by the necessary condition for the denaturation 
phase transition. The DNA denaturation kinetics consid- 
ered here can be extended spatially to describe a front 



propagation process in the presence of protein-mediated 
loops and the noise that corresponds to the in vivo case. 

Finally, we have revealed a new purpose of the protein- 
mediated looping that is to facilitate in vivo denaturation 
of DNA needed to take it to the next transcription stage. 
The model also describes the formation of multiple loops 
via dynamical (fluctuational) linking between looping pro- 
teins, that is essential in cellular biological processes such 
as the pre-mRNA splicing [28] and the phenomenon of ge- 
nomic plasticity [29(. It can mimick, e.g., the revolution- 
ary networks of splicing cis-regulatory elements [30l | hav- 
ing the loops to splice out introns thus defining the exons 
within the DNA molecule. The presented theory can be ap- 
plied in studies of the above enumerated in vivo processes 
as well as for description of in vitro experiments with sin- 
gle DNA molecules. Yet another application of this or a 
generalized statistical mechanics model would be an inves- 
tigation of dynamic genome architecture in eukaryotic cells 
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